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Abstract 



Consider the complete n-vertex graph whose edge-lengths are independent exponentially 
distributed random variables. Simultaneously for each pair of vertices, put a constant flow 

between them along the shortest path. Each edge gets some random total flow. In the n — s- oo 
limit we find explicitly the empirical distribution of these edge-fiows, suitably normalized. 
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1 Introduction 



Write network for an undirected grapli wliose edges e liave positive real edge-lengtlis i{e). In a 
n-vertex connected networlc, tlie distance D(i,j) between vertices i and j is tlie lengtli of tlie 
sliortest route between tliem. Assuming generic edge-lengtlis, tlie shortest route is unique. For 
each ordered (source, destination) pair of vertices (i, j), send flow of volume 1/n along the shortest 
route from i to j. (The normalization 1/n is arbitrary but convenient for ([T]) below). For each 
directed edge e (i.e. an edge e and a specified direction across e) of the network, let /(e) be the 
total flow across the edge in that direction. Note 

n-' E /(e)£(e) = n-2^^I)(z,j):=5 (1) 

directed 6 i j 

where D is the average vertex-vertex distance. 

One can formulate a project to study the distribution of such edge-flows /(e) in different models 
of random n-vertex networks. Such models include both deterministic graphs to which random 
edge-lengths are assigned, and random graphs of both the classical Erdos - Renyi or random 
regular type [8 and the more recent complex networks types PQ [TUl [13 EI] again with real edge- 
lengths attached. As implies, this project is a refinement of the project of studying EZ)„, so 
we envisage a model sufficiently tractable that we know 

EDn = (1 + o(l))4 (2) 

for some explicit (dn). 

To set up some notation, return to the setting of a deterministic network. Because we are using 
shortest-path routing, we expect edges-flows to be correlated with edge-lengths, so let us study 
jointly edge-flows and edge-lengths by considering the empirical measure tp^ which puts weight 
1/n on each point {i{e), /(e)): 

V^0(.,.):=^ l{(^(e),/(e))G (.,•)}. 

directed 6 

So (III) becomes 

£u i;^{dl,du) = D. 

So when short edge-lengths are order 1 we should normalize edge- flows by dm that is consider 
the measure 

M;-)--=l E l{We),/(e)R)G (•,•)}. (3) 

directed 6 

For a random network, Z)„ is a random variable and V'n("5 ■) is a random measure, related by 

lyMd^,dy) = ^. (4) 

dn 

This notation is designed to suggest possible n — > oo limit behavior; that the random measures V'n 
converge to a non-random measure V' which by ^ and under appropriate uniform integrability 
conditions must satisfy 

ly^{dl,dy) = l. (5) 



The purpose of this paper is to prove this result and identify ip in one particular model, described 
in the next section. There is a fairly simple heuristic argument to identify -0, shown in section 
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1.3 The heuristic argument yields predictions for the limit in many "locally tree-like" models, 
as discussed in section 4.4 However for the proofs in this paper we exploit special structure of 



our model, and it seems technically challenging to find rigorous proofs in the broader settings of 
section 14.41 



1.1 The complete graph with random edge-lengths 



Our probability model for a random n-vertex network starts with the complete graph and assigns 
independent Exponential(rate 1/n) random lengths Lij = Lji = Lg to the (2) edges e = 
This model (which we denote by Qn) and minor variants (uniform(0, 1) lengths; complete bipartite 
graph) have been studied in various contexts, for instance the length of minimum spanning tree 
[TT] . Steiner tree [9], minimum matching |3l HH HSl [20] and traveling salesman tour [T71 El |27] . 
Note that our scaling convention ELg = n makes lengths a factor n larger than in most of the 
earlier literature. Most closely related to the present paper is the work of Janson [12] and van der 
Hofstad et al. [25| who studied several aspects of the distances Dn{i,j)- see also Wastlund [26] for 
connections with minimum matching. In particular, it is known (13) that EZ)„ = (1 + o(l)) logn 
so that we use dn '■= logn to scale edge-flows. 



1.2 The main result 



To fix notation, the vertex-set is [n] := {l,2,...,n}. All quantities in the n-vertex model Qn 
depend on n; our notation makes n explicit only where helpful. For each ordered pair write 
7r(i, j) for the shortest path (considered here as a set of directed edges e) from i to j. Define 

Pn{e)='-Yl HeeniiJ)} (6) 

ie[n] je[n],j^i 

SO that Fn{e) is the total flow across the directed edge e in the specified direction, when a flow 
of volume 1/n is put along the shortest path between each ordered vertex pair. Write 7^ for 
cardinality. 



Theorem 1 As n — > 00 for fixed z > 

POO 

^#{e:Fn{e)>zlogn}^L^G{z):= F{WiW2e-^ > z) du (7) 

Jo 

where Wi and W2 are independent Exponential(\ ). In particular 

^E#{e:F„(e)>zlogn}^G(z). (8) 

In more detail, for dn := logn the random empirical measure ipn at ^ converges to the non- 
random measure ijj which is the "distribution" of{Uoo, WiW2e~^°°) when Uoo is uniform on (0, 00) 
and independent of (Wi, W2)- 



In the final assertion we wrote "distribution" because ifj \s a cr-finite distribution. As explained in 
section 2.9, "convergence of ^„" means l} convergence over the vague topology. The appearance 



of a (T-finite limit is not surprising, because edges of fixed large length carry a flow which is small 
but non-negligible compared to flow across edges of length 1. Note the anticipated identity ([5]) 
holds because Jq°° ue"" du = 1. Note also that the scaling of edge-lengths in Qyi, doGS not affGct 
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the conclusions (7]|8) which remain true if edge-lengths have Exponential 1) or Uniform(0, 1) 
distribution. 



See section 4.1 for further discussion of the function G{z). In particular its tail behavior is a 
stretched exponential 

G{z) = exp(-zH°(i)) as z ^ oo (9) 



rather than an ordinary exponential as one might have guessed. Section 4.3 states the analog of 
Theorem [T] for the distribution of flows through vertices instead of edges. 



1.3 A heuristic argument 

Here is a heuristic argument for why the limit is this particular function G{z). Consider a short 
edge e, that is an edge of length 0(1). Suppose there are W^{t) vertices within a fixed large 
distance r of one end of e, and W'J{t) vertices within distance r of the other end. A shortest- 
length path between distant vertices which passes through e must enter and exit the region above 
via some pair of vertices in the sets above (see Figure 2), and there are W^{t)W'^{t) such pairs. 
The dependence on the length Le is more subtle. By the Yule process approximation (Lemma 
[3]) the number of vertices within distance r of an initial vertex grows as e*", and it turns out that 
the flow through e depends on Le as exp(— Le) because of the availability of alternate possible 
shortest paths. So flow through e should be proportional to VFg(r)l^g'(r) exp(— Lg). But (again 
by the Yule process approximation, Lemma [3| for large r we have e~'^W^{T) has approximately 
an Exponential 1) distribution Wi. And as n — > 00 the normalized distribution n^^^{e : Lg G •} 
over directed edges converges to the u-finite distribution of Uoo- This is heuristically how the 
limit joint distribution (C/00, W^iW^2 exp(— C/qo)) arises. 

Our proof of Theorem [T]is essentially just a formahzation of the heuristic argument using explicit 
calculations exploiting the special structure of our random network model. But we exploit a 
variety of tools to handle the details. For proving (section |2]) the "expectation" assertion ([s]) the 
key idea is 

• analyzing the behavior of the percolation (flow from vertex 1) process in a given neighbor- 



hood (sections |2.5| and 2.6 1 

but we also use 



the Yule process local approximation (section 2.3) 



a martingale property (section 2.2 1 



a general weak law of large numbers for local functions on Qn (section 2.8) 



For proving (section [s]) the convergence assertion ([t]), we need to study the joint behavior 
of two shortest paths 7r(l, 2), 7r(3, 4). This involves somewhat intricate conditioning arguments. 
The key ideas are 

• finite-n bounds for mean intensities of short paths (section |3.1[ ) 

• the size- biased Yule process (section |3.4[) 
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conditional on existence of a given short path from vertex 1, the process of numbers of 
vertices within distance t from vertex 1 grows as a size-biased Yule process (section 3.4). 



2 Proofs 



2.1 Preliminaries 



Exponential(A) and Geometric(p) denote the exponential and geometric distributions in their 
usual parametrizations. 

Here we collect without proof some standard properties of the random network model Qn- For 
fixed n and for t > define 

Nn{t) := number of vertices within distance t from vertex 1 (10) 
where we include vertex 1 itself; 

Sn,k •= min{t : Nn{t) = k}, 1 < k < n — 1 
so that Sn,k+i is the distance from vertex 1 to the /c'th nearest distinct vertex. Then 

{Sn,k+i — Sn,k, I < k < n — 1) are independent Exponential ^^"^ ) . (11) 

Because the distance D{1,2) is distributed as Sn,v for ^ uniform on {2,3, . . . ,n}, it is straight- 
forward (see e.g. |13|[25] for similar calculations) to use (11 ) to write exact formulas for the mean, 
variance and generating function of D(l, 2) and then deduce the n ^ oo limit behavior 

EL>(1,2) -logn ^ ci, var L>(1,2) ^ C2, ^(1, 2) - log n ^ Doo (12) 



for finite constants ci,C2 and a distribution Dqo discussed further in section 4.6 Note that the 



average vertex-vertex distance Dn at ([1| has the same mean, but not the same distribution, as 
D{1,2), so 

EDn = logn + 0{l) asn^oo. (13) 



There is a natural mental picture of (first passage) percolation, in which at time there is water 
at vertex 1 only, and the water spreads along edges at speed 1. So Nn{t) vertices have been 
wetted by time t. Each vertex j ^ 1 is first wetted via some edge and this collection 

of directed edges forms the percolation tree rooted at ver tex 1. The "flow" in Theorem [l]fr om 
vertex 1 goes along the edges of this percolation tree, and no other edges. 

Associated with the percolation process is a filtration {J-'t), where J-t is the information known 
at time t, illustrated informally as follows. Write Tj for the wetting time of vertex i. Then the 
values of Tj for which Ti < t are in J^f On the event {Tj < t < Tj}, the information in J^t about 
Lij is that Lij > t — Ti. By the memory less property of the exponential distribution, on the event 
above the conditional distribution of Lij — {t — Ti) given J-^ is Exponential 1/n), and the {Lij) are 
conditionally independent given J^f More elaborate versions of this memoryless property appear 
in Lemmas [7] and m 

An obvious consequence of the Exponential 1/n) distribution of edge-lengths Lij is that 

nP(Ljj < t) ^ T as n ^ oo. 
In words, this says that the measure nF{Lij G •) converges vaguely to Lebesgue measure on (0, oo). 
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2.2 A martingale property 



In the percolation process above, write Wn(t) for the set of vertices wetted by time t. The 
following martingale property turns out to be useful. 



Lemma 2 Let Wn be a stopping time for the percolation process on Qn- For each v £ >V„(VF'„) let 
Y{v) be the number of vertices j G [n] such that, in the shortest path from 1 to j, the last-visited 
vertex ofWn{Wn) is vertex v. Then 



Proof. Define Y{v, t) as Y{v) but counting only vertices j which are wetted by time t. As t 
increases, whenever a new vertex j is wetted via some edge the predecessor vertex 

is a uniform random element of W{t—). It follows easily that the process jfjj^, t > Wn is a 
martingale. The optional sampling theorem shows 



E 



Y{v, oo) 



iV„(oo) 



1 



Y{v,Wn) ^ 

Nn{Wn) Nn{Wn) ' 



But 



Y(v,oo) _ Y(v) 
N„{oo) ~ n 



2.3 The Yule process approximation 



The Yule process {Noo{t), < t < oo) is the population at time t in the continuous-time branching 
process started with one individual, in which individuals live forever and produce offspring at the 
times of a Poisson(l) process. Writing 

5oo,fc := min{t : iVoo(t) = k} 

we have 

{Soc,k+i — 5*00, fc, I < k < oo) are independent Exponential(A;) (14) 



and so the Yule process is the natural n ^ oo limit of the process (11) associated with the 
percolation tree. We quote some standard facts about the Yule process. 



Lemma 3 (a) Noo{t) has Geometric(e~^ ) distribution. 

(b) e~^Noo{t) is a martingale which is bounded in L^, and e~^Noo{t) — > W a.s. and in as 
t — > oo, where W has Exponential(\ ) distribution. 



It is intuitively clear that that the local structure of Gn relative to one vertex converges to the 
Yule process. Abstractly {6], we call this notion of convergence of random graphs local weak 
convergence and the limit structure (the Yule process regarded as a "spatial" graph) is called 
the PWIT. But rather than work abstractly we will state only the more concrete consequences 
needed, such as the next lemma. 



Lemma 4 Fix k >1 and t < oo. For 1 < i < k let Nn\t) be the number of vertices ofQn within 
distance t from vertex i. Then as n ^ oo 

(Ar(i)(t),...,ArW(t)) 4 {N(^\t),...,N^\t)) 

where the limits N!^{t) are independent Geometric{e'~*) . 



6 



Proof. For k = 1 this follows from ( 11|14 ) and Lemma |3|^a). For general k, use the natural 
conditioning argument. ■ 

The following technical lemma shows one way in which the "exponential growth with rate 1" 
property of the Yule process (Lemma |3|b)) translates to the percolation process. 

Lemma 5 Let Wn be a randomized stopping time for the percolation process on Qn- Fix e > 
0, (J < oo and a sequence such that u;„ — > oo with cOn < n^/^. Then as n ^ oo 



1 - e < . < 1 + e 



e-Nn{Wn 



uniformly on {ujn < Nn{Wn) < n/ujn}- 



Proof. It is enough to show this holds conditionally on N(Wn), that is to show 

lp(l-£< ^"[^y < l+£ Wn=tn,Nn{Wn) = kn^ ^1 (15) 

whenever kn oo, n/kn oo. Note that the value of tn does not affect the conditional proba- 
bility. 



First note that by ( lTp4 ) we can couple A'^„(-) and A^oo(-) by constructing each from the same 
i.i.d. Exponential 1) sequence (Yi) via 



Soo,k = ^ jYf, Noo{t) = max{k : Soo,k < t} 



i=l 
k-l 



Sn,k = ^ if^^i'^ ^n{t) = max{A: : Sn,k < t}. 



i{n—i) 
i=l 

/,From this coupling we see that for kn < rrin with fc„ — > cxd, n/rrin — > oo we have 

Sn,mn ^n,k„ 



s, 



oo,m„ 5*00, fc' 



1 a.s. 



(16) 



(17) 



By the homogeneous branching property of the Yule process and Lemma |3]^a), conditional on 
{Nooitn) = kn}, we can represent Noo{tn + cr) as the sum of kn independent Geometric(e~'^) r.v.'s, 
and so (still conditionally) ^°°fc^e"°^'^'* — > 1 in probability. In terms of (5'oo,fc) this is equivalent to 
the (now unconditional) property that 



if kn ^ oo, m„ ~ knc"^ then 5oo,m„ - <S'oo,fc„ 



a. 



Now by (17) and assumptions on (/c„) we see 



if kn oo, run^ knc" then Sn,m„ - Sn,k„ cr. 
This holds for each fixed o" > 0. Translating this back into an assertion about {Nn{-)) establishes 



(15). 



Recall that D{i,j) denotes vertex-vertex distances in Qn- 

Lemma 6 For disjoint subsets B,C C [n] and for any d > 0, 

E#{{i,j) eBxC: D(i,j) <d]< #B#Ce''/{n - 1). 



7 



R(t)( vertices which are within t from vn) 

L(t) ( vertices which are within r from vn) 

Figure 1: Neighborhood about an edge 



Proof. By linearity and symmetry we reduce to the case where B and C are singletons. Then 
the left side equals PCp(l,2) < d) = {n - l)~^(EA^„(d) - 1). By the coupling ^ to the Yule 
process, and Lemmak3[a), ENn{d) < ENoo{d) = e^. ■ 



2.4 Local structure in the n- vertex model 

In this section we give a result (Lemma [7]) describing the global structure of Qn conditional on 
a given local structure. The actual result is obvious once stated, but requires some notational 
effort to set up. 

Fix a real r > 0. Let t be a finite unlabelled tree with edge lengths, with the following properties 
(see Figure 1). 

(i) There is a distinguished directed edge, whose end vertices can then be labelled as {vL,vji), 
defining a partition of all the vertices of t as V{t) = L(t) U i?(t). Here L and R are mnemonics 
for left and right, and the distinguished edge is directed left-to-right. 

(ii) Every vertex in L(t) is within distance r from vl, and every vertex in R{t) is within distance 
T from vr. 



Write T,- for the set of such trees t. For such t, write i{e) for the length of the distinguished 
edge e. And for each vertex v £ L{t) write b{v) = r — D{v,vl) as the "distance to boundary" 
(as in Figure 1, we envisage a boundary drawn at distance at distance r from vl and from vji). 
Similarly for v G -R(t) write b{v) = t — D{v, vji). Finally, given t and given a subset A C [n] with 
= ^V{t), let tyi denote some labelling of the vertices of t by distinct labels from A. 
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Now consider the random network Qn- We occasionally want to regard an edge of Qn as 
a point-set, so that a point on the edge is at some distance < u < Lij from vertex i and 
at distance Lij — u from vertex j, with this notion of distance extending in the natural way to 
distances between a point on an edge and a distant vertex. Fix r and vertices vl,vr E [n]. Define 
the "neighborhood" Mriv^jV^i) as the subgraph of Qn whose vertex-set consists of vertices v for 
which mm{D{v,vi,), D{v,vr)) < r. Its edge-set is the subset of edges of Gn such that, for every 
point along the edge, the distance to the closer of {vL,vji} is at most r. It also contains by fiat 
the distinguished directed edge {vl,vr). 

In general Nt{vl, vr) need not be a tree, but if it is a tree then clearly it is a tree of the form 
for some t G T,- and some AC [n] . In this case we can define h{i) (meaning distance to boundary 
of neighborhood) as above for vertices i oi Mt{vl,vr), and we define h{i) = for other vertices 
of Qn- 

Lemma 7 Fix n,T,VL,VR andtA- Then conditional on Mr{vL,vji) = t^, the lengths (Lij) of the 
edges of Qn which are not edges of Mt{vl,vr) are independent r.v.'s for which Lij — h{i) — b{j) 
has Exponential(\ / n) distribution. 

Proof. Saying that («, j) is not an edge oi Mt{vl-,vr) is saying that Lij > h{i) + b{j). The edge- 
lengths {Lij) are a priori independent Exponential(l/n), and conditioning on all these inequalities 
leaves them independent with the stated distributions. 

2.5 The percolation tree on a neighborhood 

We now come to the central idea of the proof, which is to study how the percolation tree behaves 
on a given neighborhood. Until further notice we adopt the setting of Lemma [7] and work con- 
ditionally on Mr{vL,VR) = tA- Let E,- and Pr denote respectively the conditional expectation 
and conditional probability operations; and we will use tilde notation Lij to denote uncondi- 
tioned quantities. Thus the conclusion of Lemma [7] can be rewritten as follows. Starting with 
independent Exponential 1/n) r.v.'s (Lij) we can construct the conditioned lengths (Lij) as 

Lij = Lij + b{i) + b{j), {i,j) i Nr{vL, vr). (18) 
This provides a coupling of the unconditioned and conditioned lengths. 

Now consider the percolation process started at vertex 1, and assume 1 ^ A. Let Tl be the first 
time (in the conditioned model) that the percolating water gets to some point on an edge at 
distance r from vl without passing along the distinguished edge. Define Tr similarly and then 
set T = min(Ti, Tr). So at time T the water is at distance b{H) from some random vertex H of 
A. See Figure [2j 

Lemma 8 (a) The (conditioned) distribution ofT is the same as the (unconditioned) distribution 
ofTA, the first time in percolation on Qn that some vertex in A is wetted. And the random vertex 
H is distributed uniformly on A, independent of T. 

(b) N(T) is distributed as the smallest of^^A uniform random samples without replacement from 
{2,3,...,n}. 

(c) Let T' denote the second time that the percolating water gets to within distance t of either 
Vl or Vr along some path which has not previously hit t^- Then (N(T), N(T')) has the joint 
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vertex 1 
O 



Flow leaving R(t) 



Flow from vertex 1 entering L(t} 





edge e 



R(t)( vertices which are within r from v^}) 

L(t) ( vertices which are within r from Vfi) 

Figure 2: Flow from 1 passing through the neighborhood. Flow enters at time Ti along 
some edge {H' , H), and exits at time + 2t + £{e). 



distribution of the smallest and the second smallest of i^A uniform random samples without 
replacement from {2, 3, . . . , n}. 



Proof. Use (18) to construct the conditioned process from the unconditioned process. In the 
unconditioned process it is clear by symmetry that H, the first vertex of A wetted, is uniform on 
A and independent of T^. Obviously H is reached along some edge {H\H) with H' ^ A. In the 
conditioned process, at time Ta the percolating water has reached distance b{H) = b{H) + b{H') 
from and hence is distance r from either vl or vr (whichever is closer to H). So T = Ta in 
the coupling. This gives (a). Parts (b) and (c) are similar. ■ 

The next lemma formalizes the idea "what do we know about edge-lengths at time T^?" As 
described above, on {Ti < Tr} there is some vertex H £ L{tA) such that Th — Tl = b{H), and 
vertex H gets wetted via some edge {H' , H) where H' is in the set VV(Ti) of vertices wetted by 
time Tl. Arguing as in Lemma [7] shows 

Lemma 9 Conditional on Mr{vL,vji) = and conditional on a^J^Ti^, H' , H), on the event 
{Ti < Tr}, the collection of edge-lengths (Lij) as {i,j) runs over all edges except 

(i) an edge of Nr{vL,VR) 

(ii) ijGWin) 
(Hi) {i,j) = {H',H) 

are independent with distributions 

Uj=Uj+b{i) + b{j), {i,j)^W{TL) 
Lij = Li, + {TL-Ti) + b{j), ieWin). 
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Recall we are conditioning on Mr{vL,VR) = t^. Write 

a := 2r + £(e) 

where e is the distinguished edge e = {vL,vji) of tA_- Estimating the mean flow through e is 
tantamount to estimating the mean of the random variable 

Mi:=#{2<j<n: e E 7r(l, j)} (19) 

counting the number of vertices j G [n] with j 7^ 1 such that the shortest path from 1 to j passes 
through e. To analyze Mi, we consider the set 7^*(t^) of vertices in -R(t^) which are flrst reached 
via edge e: 

n*{tA) :={veR{tA):ee7v{l,v)}. 
Note that from the deflnitions of Tj^ and 

if Tr < Tl then #7^*(tyl) = and Mi = 
if 7l < Tr < + cT then < #7^* (t a) < #i?(t a) 
ifTR>TL + a then #71* {t a) = #R{tA)- 



Lemma 10 E,(Mi| = • 
Proof. In the notation of Lemma |2] 

Mi= ^ (20) 

Applying Lemma [2] with W = Tl + a (the fact we are working conditional on Nt{vL)Vr) = 
doesn't affect the martingale property after time W) gives the second equality below: 



N{W) ■ 



2.6 The conditioned mean flow 

We now start studying n ^ 00 asymptotics. Recall the deflnition ([6]) of the normalized flow 
Fn{e) across an edge e of ^n- The next result calculates the expected flow conditional on the 
neighborhood structure of Qn around e. 

Proposition 11 Fix t and t £ Tj- and write a = 2t + l{e) where e is the distinguished edge e 
oft. Let 'u£ / f ^ G [n] and let ^ An C [n] satisfy #An = #t. Then as n ^ 00, setting 

en = {vl,v'lQ, 

MK{en)) = (l + o(l))#L(t)#i?(t)e-'^logn. 
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Here !£,-(•) denotes E( •|A/'r(w£, fj^) = iA„)- All quantities except t,T,a depend on n, though the 
dependence is often not made explicit in notation in the proof below. 



Proof. Assume vertex 1 ^ A. Recall the definition of Mi from (19). We shall show that 



E,(Mi) = (l + o(l))#L(t)#i?(t)e-'^logn. (21) 

In terms of flows of volume 1 /n between each vertex-pair, the (conditional) mean contribution 
to the flow Fn(e) through the distinguished edge arising from flow started at vertex 1 ^ A equals 
n~^E(Mi). The same contribution arises from each of the n — ^t possible starting vertices v ^ A. 



For V A the flow through e is trivially bounded by 1. So to prove Proposition 11 it is enough 



to prove (21 ). 



Fix a sequence ujn — > oo, with uJn = o(logn). Start the first passage percolation process from 
vertex 1. Recall that denotes the first time the flow is within distance r from vl, and that by 
time Ti + a the flow has wetted every vertex in t^. We shall show that the dominant contribution 
to Er(Mi) is from the "good" event 

G* := {iUn < Nnin) < n/iOn} H {Tn >Tl + a}. (22) 



For the details, observe that we can apply Lemma |5] to the percolation process on edges excluding 
the edges of and deduce 

J-^Tl^ 1 uniformly on {uJn < N{Tl) < n/ujn} (23) 

where N^^^i^^t) is the number of vertices wetted by time t via paths which use no edge of t^^. For 
the rest of the argument we work on the event {Tl < Tr} (otherwise. Mi = 0). Recall (Figure 2) 
that flow enters the neighborhood at time Tl along some edge (H' ,H). We claim: conditional on 
a{J^TL, H, H'), on the event {Tl < Tr}, the expected number of vertices of i?(t^) wetted before 
time Tx, + (7 by paths not using the distinguished edge is at most 

{N{Tl) + #L(t))#i?(t)eV(n - 1). (24) 



V " e-N{TL) - 



This follows from Lemmas |9] and [1] applied to W{Tl) U L{tA) and i?(t^), because the former 
lemma implies that the conditioned edge lengths can only be longer than the unconditioned edge- 



lengths in the latter lemma. Note that the expectation (24) tends to on {N(Tl) < n/iVn}, so 
that 

¥,{Tii >Tl + a\TTj^ ^ 1 uniformly on {N^Tl) < n/u^n] D {Tl < Tr}. (25) 

Next let A^via(i) denote the number of vertices outside tA which have been wetted by time Tl + ct 
using some path via tA- Again using Lemmas [o] and [6| applied now to L{tA) and [n] \ A , we find 

E,(l{Ti < Tn}N^UTL + ct)|^tJ < #^(t)e^ (26) 

^From the definitions, 

< N{Tl + a) - A^_m(Tl + a)< N^^Tl + (t) + #t. 
Combining this with (23j 25 26) we deduce 

J^TlJ 1 uniformly on {a;„ < N{Tl) < n/un] n {Tl < Tr}. 

(27) 
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Recalling the definition (22) of G*, Lemma 10 implies 



Er[Mil{G*)] 

n #R{t) 



1{G*) 
N{Tl + a) 



(28) 



To obtain asymptotics for the right side, use the Lemma [8|^b) description of the distribution of 
N(Tl) to conclude that 

{Tl < Tr, Nnin) = m) = (1 + 0(1))^^ uniformly on {2 < m < n/c^„} . (29) 

n — 1 

The harmonic sum estimate YlZi^" = + o(l))logn leads to 
1 



l{0Jn < N{Tl) < n/LOn}l{TL + <T < Tr} 



(1 + o(l))#L(t)e-^n-^ logn. 



Combine this with (27) to get 



1{G* 



N{Tl + a) 



(1 + o(l))#L(t)e"'"n"Mogn 



and thus by (28) 



E,(Mil(G*)) = (l + o(l))#L(t)#i?(t)e-'^logn. 
Recalling that Mi = on {Tr < T^}, we can write 

Er[Ml] = Er[Mll{G*)] +Er[Mll{Bl)] +Er[Mll{B2)] +Er[Mll{B3)] 

for the "bad" events 

Bi := {Tl < Tr} n {N{Tl) > n/uj^} 

B2 := {Tl < Tr} n {N{Tl) < u;„} 

B3 := {Tl <TR<TL + a}n {ujn < N{Tl) < n/u^n} 

and we need to check for each B that ET-[Mil(i3)] = o(logn). In each case we start by using 
Lemma [TOl 

E,[Mil(i?i)] = nEr ( ,f,^*^^^\ l{TL < Tr}1{N{Tl) > n/un} 
< #R{t)un = o{logn) 
because we chose tOn = o(log n) . 

E,[Mil(i?2)] = nEr ( ^^[^I'^I^ HTl < Tr}1{N{Tl) < ojn}^ 



< n#R{t)Fr {Tl < Tr, Nnin) < a;„) by (|29 
= (l + o(l))#i?(t)#L(t)^„ 
= o(logn). 



For the third event, 



Er[Mil{B3)] = nEr (^tP^HTl < Tr < Tl + a}l{ujn < N^in) < n/un}) • 
\N{Tl + (j) J 
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By (24) we have 



FriTR<TL + a\ ^tJ < {Nin) + #L(t))#i?(t)e'^/(n - 1). 

Because N{Tl) < N{Tl + a) and #7^*(tA) < #i?(t), writing C = {Tl < Tr} n {a;„ < A^(Tl) < 
n/ujn} we have 



[Ml 1(^3)] < n#ii(t)E^ 



(iV(ri) + #L(t))#i?(t)e'^ 1 



n — 1 



'N{Tl 



-1(C) 



< (l + o(l))(#i?(t) + l)2#L(t)e^ 



This completes the proof. 



We record a minor rephrasing of Proposition 1 1 



Corollary 12 In the setting of Proposition 11, suppose vertices 1,2,0 An. Then 
P,(e„ € 7r(l, 2)) = (1 + o(l))#L(t)#i2(t) 



logn _^ 

e 

n 



Proof. By symmetry over vertices j An U {1} we have PrCcn £ 7r(l, 2)) = where 

is defined as Mi but excluding vertices j G An- Since Mi — M* < the corollary follows from 



(21). 



2.7 Conditional variance of the flow 



In the setting of Proposition 11 we want to show that the flow Fn{e) is close to its conditional 



expectation, and the natural way to express this is via the conditional variance. 



Proposition 13 In the setting of Proposition 11. 

WS.Ir{Fn{en))Wr{vl,vl)=tA^) . 1 1 



"^riFnienWrivl.V^, 



tA 



< 



+ 



+ 



1 



#L(t) #R{t) #L(t)#i?(t) 



+ o(l) as n 



oo. 



This formulation emphasizes that the relative variance of the conditional distribution gets smaller 



as the size of the neighborhood gets bigger. We remark that Proposition 11 alone (i.e. without 
Proposition 13) is enough to prove the "expectation" asssertion ^ of Theorem [T| Proposition 
13 is needed for the L^-convergence assertion ([7|. 



The key step in proving Proposition 13 is the following Proposition. To set up notation, we may 
assume the edge Sn is (n — 1, n) and the label set An is {n — #V{t) + 1, . . . n}. Recall 7r(l, 2) 
denotes the shortest path from 1 to 2. If this path uses then there is some entrance-exit pair 
(a, /?) G -^^(t) X R{t) recording the first and last vertices of the neighborhood visited by the path 
(here we identify vertices of t and tA„ ) • 



Proposition 14 Let (a,/3) and (7,5) be pairs in L(t) x i?(t). As n ^ oo 

PT-(7r(l,2) contains Cn with entrance-exit pair {a, (3); 7r(3,4) contains Cn with entrance-exit pair (7,(5)) 
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<(l + o(l)) 



logn^-cr 



where 



(30) 



Proposition 14 (more precisely, the variant Proposition 18 described in section 2.10 ) will be proved 
in section |3j Intuitively we have "= (1 + o(l))" instead of "< (1 + o(l))", but the fact that we 
need only prove an inequality is technically helpful. Also intuitively, the constant k arises as 

where Wa is the limit Exponential 1) r.v. arising (cf. Lemma[4| in the growth of the percolation 
process from source a using flows avoiding any other vertex of the neighborhood. 



Proof of Proposition |13[ The sum of Kq:,/?,^,^ over all choices of a, /3, 7, 5 works out as 
#L(t)#i?(t)(#L(t) + l)(#fl(t) + 1). So 



P^(7r(l,2) contains e„ ; 7r(3,4) contains e„ ) < (l+o(l))#L(t)#i?(t)(#L(t)+l)(#i?(t)+l) 
Using Corollary [12] we get a covariance bound 

PT-(7r(l,2) contains e„ ; 7r(3,4) contains e„ ) — P^(7r(l,2) contains Cn ) 



< (1 + o(l))#L(t)#i?(t)(#L(t) + #i?(t) + 1) 



(31) 



The contribution to Fn{en) from source-destination pairs where i or j is in t is negligible, 
so we may replace F„(e„) by 

Gn{en) := ^^l{en G7r(i,i)} 

{id) 

where here and below the sum is over ordered pairs of vertices in [n] \ An. Writing e = e„, 

var^^l{e G 7v{i,j)} < ^ l{e G 7r(i,j)} 
(hJ) {hi) 

+ [PT-(7r(i,j) contains e ; n{i',j') contains e ) — P^(7r(z,j) contains e )] . 

{i,j) {i',j')f^{^,j) 

Using symmetry and a compatibility condition (a directed edge e cannot be in the shortest path 
from i to j and also in the shortest path from j to k) we find 

var^G„(e) < n^^ET-G„(e) + 2nPT-(7r(l, 2) contains e ; 7r(l,3) contains e ) 

[PT-(7r(l,2) contains e ; 7r(3,4) contains e ) — P^(7r(l,2) contains e )] . 



The first term is 0{n ^ log n) by Proposition 11 Bounding the second term crudely by 2nPT-(7r(l, 2) contains e ) 



term, which by (31 ) shows 



and using Corollary 12 shows the second term is O(logn). So the dominant term is the third 



var,G„(e) < (1 + o(l))#L(t)#fi(t)(#L(t) + #R{t) + 1) [logn e"'^] 



Combining with Proposition 11 we have established Proposition 13 
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2.8 WLLN for a local functional 



The point of Propositions 11 and 13 is that the normahzed flow Fn{e) / logn, which a priori 
involves the global structure of Gn, can be approximated by a certain functional, (pn{e) below, 
which depends only on the "local" structure of Qn near e. It is a general fact that empirical 
(random) distributions of such "local functionals" on Qn converge to the limit non-random dis- 



tribution associated with the Yule processs/PWIT mentioned in section 2.3 Rather than prove 



a general result in this context (for the general result in a different context see [2^ Proposition 7) 



we will just derive the specific result we need. Proposition 15 



Fix r > 0. Recall from section 2.4 the definition of the neighborhood A/'r(e) of a directed edge e 



of Qn- For each directed edge e define 

</,;(e) = #L(A/;(e))#i?(A/;(e)) exp(-2r - Le)l{Le < r}l{ A/;(e) is a tree}. (32) 

Define as the empirical measure on [0, cxd)^ obtained by putting weight 1/n on each point 
(Le, (pni^)) associated with the edges e of Qn for which Lg < r: 



directed 6 

and define the mean measure 

$;(•,•) =E$;(.,.). 

Define a limit measure 



where W{ and VFJ are independent Geometric(e~'^). So has total mass r. 

Proposition 15 For any continuous test function h : [0,oo)^ M with compact support, 

hd^l^ I h d^L in L^. 



The proof rests upon the following straightforward lemma. Although superficially similar to 
Proposition 14 in using vertices 1,2,3,4 as typical vertices, their role here is different. The 
precise statement is a bit fussy because the neighborhood must be a tree in order for left and 
right sides to be well-defined. 



Lemma 16 Fix r > and the directed edges (1,2) and (3,4). 

(a) nP(Li2 < T,Li2 G •) converges vaguely to Lebesgue measure on [0,t]. 

(b) Uniformly on {Lu < r} 

P(7V;(1,2) is a tree\Li2) 1 (33) 

as n ^ CO. The same holds for edges (1,2) and (3,4); that is, uniformly on the set {L12 < 
r} n {L34 < r} 

P({7V;(1, 2) is a tree] n {A/;(3,4) is a tree}\Lu, ^34) ^ 1- (34) 



(c) Let < ^12,^34 < T. Write {Nn{l), Nn{2)) for the numbers of vertices in the left and 

16 



right sides of the neighborhood A/'t(1,2); define {N^(3),NI^{4)) similarly for the neighborhood 
A/'r(3,4) (these are well-defined when the neighborhoods are trees). Conditional on the event 
{Mt{1, 2) is a tree} n {Mt{3, 4) is a tree} n {L12 = ^12, -Z>34 = ^34} we have 

(7V„-(l),iV^(2),iV;(3),iV;(4)) ^ {W[,W^,W^,Wl) 
where the W 's are independent Geometric(e~'^ ). 



Proof of Proposition 15 The mean measure equals 



a2T 



L\2 ^ . 



Because nP(Li2 < r, L12 € •) converges vaguely to Lebesgue measure on [0,r], Lemma 16 (here 
only vertices 1 and 2 are relevant) implies vague convergence — > of mean measures. To 
get convergence it is enough to show that for a generic test function h we have 



var 



h d^n 



var 



e:L„<7 



Expanding the right side as the variance-covariance sum, the contribution to variance from 
terms (e, e') with 4 distinct end-vertices tends to by Lemma 16 and the fact that n^P(Li2 < 



T,A/'r(l,2) is a tree,Li2 S -,-^^34 < T,A/'r(3,4) is a tree ,^34 S •) converges vaguely to Lebesgue 
measure on [0,r]^. The contribution from terms with 3 distinct end- vertices is bounded by 



1 



n 



EA?(r) ^ 



where Ai(t) is the number of edges at 1 with length less than r. And the contribution from pairs 
(e,e) is bounded by ||/i||^P(Li2 < r) ^ 0. 



2.9 Completing the proof of Theorem [T] 



The remainder of the proof uses only "soft" arguments. 



Propositions 11 and 13 were stated for fixed t E T,-, but it is clear that convergence is uniform 
over subsets of T,- on which the length of distinguished edge is bounded and the number of 
vertices is bounded. Rephrasing those Propositions gives, after some obvious manipulations: 



Corollary 17 Fix r > and K < cxd. As n ^ co 

Fn{en) 



E 



var 



log n 

Fn{en) 

logn 



A/;(e„) ) < 3X^6"^ + o(l) 



uniformly 0V6T €fi satisfying 

Nr{en) is a tree ; Le„ < r; uiayi{ij^L{J\fr{en)),#R{Mr{en))) < Ke^. 
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Now fix e > 0. Applying Chebyshev's inequality (conditional on Nr{&n)) and taking limits, 
limsupEi V l{Le < TMr{e) is a tree}l{max(#L(AA,(e)), #i?(AA,(e))) < i^e"}l{|^-0;(e)| 

n ,. , 

directed 6 

And using Lemma [TB] 

limsupEi V l{Le < T}l{7V;(e) is a tree }l{max(#L(A/;(e)), #i2(7V;(e))) > Ke''} 

directed G. 

= limnEl{Li2 < t}1{A/;(1, 2)is a tree }l{max(#L(AA^(l, 2)), #ii(AA^(l, 2))) > Ke^} 

n 

= T¥{uia^{Wl, W^) > Ke^) 
< 2Texp{-K + e^^). 

Combining these bounds, 

limsupE^ Yj HLe < T}l{Afr{e) is a tree }l{|^-<^;(e)| > e} < 3£-^K^e-^+2Texp{-K+e-^ 

^ directed 6 

Apply this with K = t and then let r — > oo: 

limlimsupE^ V l{Le < r}l{AA,(e) is a tree }1{| ^ - </);(e)| > e} = 0. (35) 

directed 6 



Also by Lemma 16 for each fixed r 

limsupEi l{Le < T,Afr{e) not a tree} = (36) 



We now want to be a little fussy about the underlying space for our bivariate measures, which 
we will take to be [0,oo) x (0,cxd) (recall the first coordinate is length, the second is flow). 
This means that the cr-finite limit measure ip arising in the statement of Theorem [T] is finite on 
compact subsets. Recall that vague convergence f„ ^ of measures on [0, oo) x (0, oo) means 
/ hdh'n — *■ / hdu for bounded continuous test functions h : [0, oo) x (0, oo) M with compact 
support, and that in checking vague convergence we need consider only test functions with finite 
Lipschitz norm ||/i||Lip- A random measure can be viewed as a random variable taking values 
in the space of measures equipped with the vague topology, and so it makes sense to consider 
convergence in probability 

V'n — *■ '0 in probability (37) 

for the random measures V'n appearing in Theorem [T] and this is what we shall prove. Of course 
it suffices to prove that for test functions h we have convergence in probability for the M-valued 
random variables J hdip^ and we shall prove the stronger result 



j hdijj in L\ (38) 



To prove this, recall the definitions 

M-r) = k J2 l{(ie, Fn(e)/ log n) E (•,.)} 



directed 6 
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directed G 

Fix h with support contained in [0,tq] x [0, oo). Then for t > tq 



hdipn 



< 2\\h\\^l Yl l{^e<ro,AA,(e)isatree}l{|^-(^;(e)|>e} 

directed € 

+ 2||/i||oo^ Yl l{^e <ro,AA,(e) not atree } + e||/i||Lip^ Yl HLe < 

directed 6 

Because Xldircctcd e^i-^e ^ ^o} To we can use ( 35|36 ) to deduce 



directed € 



lim lim sup E 



Because e is arbitrary, this shows 



< ^ll^llLipTO- 



Hm hm sup E 



hdil^n - / hd^ 



0. 



Proposition 15 allows us to replace the random measure by the limit mean measure 



lim lim sup E 



hdipn 



hd<^L 



But — > vaguely as r — > cxd, and so we have proved (38) and thence (37), which is our 
formalization of the final assertion of Theorem [TJ 



To prove the other assertion ([s]) of Theorem [T| recall that the fact (13) EZ)„ ~ logn becomes, 

E [ ly Mdi,dy) ^ [ iy i>{di,dy) = 1. 



via 



continuous h > satisfying sup^,^ '^^ey^ < Using such functions to approximate the function 
1{£ > e,y > z} shows 

i#{e : Le > e,Fn{e) > zlogn} -^^i 'il^{{e,oo) x {z,oo)). 

Because E^^^je : Lg < e} — > e we can let e — > and deduce 

^#{e : Fnie) > zlogn} i>{{0,oo) x (z,oo)) 

which is the first assertion of Theorem [TJ 



This enables us to extend the convergence (38) from continuous h with compact support to 



2.10 Distance based truncation of flows 



To avoid notational complications, the exposition above omitted one technical point. Recall that 



path-lengths D{i,j) are logn it 0(1) in probability. In seeking to prove Proposition 14 there are 
technical difficulties with unusually long paths, which we will handle by truncating them out. 



Precisely, instead of proving Proposition 14 we will prove 
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Proposition 18 Fix B < oo. Let {a, (3) and (7,5) he pairs in L{t) x i?(t). Conditionally on 
Mrin — 1, n) = tyi^, as n ^ oo 

P(7r(l,2) contains e„, with entrance- exit pair (a,/3); len(7r(l,2)) < \ogn + B; 

7r(3,4) contains e„ with entrance- exit pair (7,5), len(7r(3,4)) < \ogn + B) 

<(l + o(l)) 



logn^-cr 



where 



l^a, (3,^,5 — 2 



l{7=a}+l{<5=/3} 



(39) 



In this section we explain (omitting some details at the end) why it is enough to prove Proposition 



18 in place of Proposition 14 Consider the analog of flow Fn{e) when contributions from paths 
of length > log n + B are ignored: 

Pn\^)-=IY. E l{eG7r(i,j)}l{len(7r(i,i)) <logn + i?}<F„(e). 

i6[n.] ie[ra] jVi 

An easy argument shows that for large B the global effect of truncation is negligible: 



Lemma 19 



lim limsupE- > L, 



'■ log n 



Proof. Recall D{i,j) = len(7r(i, j)). Calculating the effect of truncation on edge-flows and on 
source-destination distances gives the identity 

^L,{Fn{e)-Fi^^{e)) = ^ ^Z)(i,i)l{Z)(i,i) > logn + B}. 

Taking expectations and using symmetry 

j;Le(F„(e)-Ff](e)) = ^EZ)(l,2)l{I)(l,2) >logn + i?}. 



The result now follows from the mean and variance limits at (12). 



Now we can choose Bn T cxd sufficiently slowly that (by Lemma 19 ) 

n / J logn 
e 

and such that 



the conclusion of Proposition 18 holds for B^ in place of B. 



(40) 



The idea is now to repeat the arguments in sections 2.7 - 2.9 using the truncated flow Fn {&n) in 
place of Fn{en)- This will establish Theorem [T] for the truncated flows, but then (40) establishes 
it for untruncated flows. The arguments would go through unchanged if we knew the truncated 
version of the conditional mean estimate of Proposition [Tl] 

E,(4^"l(e„)) = (l + o(l))#L(t)#fl(t)e-'^logn. (41) 
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Of course the conditional upper bound "< (1 



0(1))" in (41) follows from Proposition 11, but 



we need the lower bound "> (1 + o(l))" in (41) in order to go from the upper bound on second 



moment to the upper bound on variance - cf. (31). However, from the conditional upper bound 



and because (40 ) implies a lower bound for unconditional expectation, Markov's inequality implies 



that the conditional lower bound in (41 ) holds for all neighborhoods Nrin — 1, n) excluding some 
occuring with probability — > as n — > 00. And this is enough to complete the proof of Theorem 

m 



3 The variance estimate 



This section is devoted to the proof of Proposition 18, which will complete the proof of Theorem 
[1] Let us repeat the "conditioned" setting that we work in, throughout the section. There is a 
fixed tree t with distinguished directed edge e. In the network Qn we fix edge e„ = (n — l,n) 
and label set An = {n — + 1, . . . , n}. Label t as tA„ so that e is labeled (n — 1, n). Then 
condition on Nr{n — 1, n) = t^n- Recall that Lemma [7] tells us the effect of this conditioning. In 
particular, for i j G the length of the edge-segment (i, j) from i to the boundary of the 
neighborhood has Exponential 1/n) distribution, independently for different edges. 



Roughly speaking, the issue in proving Proposition 18 is to estimate the dependence between the 
events 

(i) the shortest path 7r(l, 2) between vertex 1 and vertex 2 uses edge e„ 

(ii) the shortest path 7r(3,4) between vertex 3 and vertex 4 uses edge e^. 

Corollary [12] tells us the asymptotic probabilities of these events, so a natural approach is to 
condition on (i) and seek to calculate the conditional probability of (ii) . Now (i) breaks into two 
assertions: 

(ia) there is a short path (length s = logn it 0(1)) from 1 to 2 via e; 

(ib) there is no shorter path from 1 to 2. 

Now conditioning on (ia) can be implemented by conditioning on all edges in the path, and this 
doesn't affect lengths of other edges of Qn- But event (ib) implicitly specifies that alternate short 
routes do not exist, and the effect of this conditioning on other edge-lengths of (while intuitively 
small) seems hard to handle rigorously. Instead, we shall carefully organize an argument to avoid 
ever conditioning on any "shortest path" event. In outline, the argument has three steps. 



Calculate chance of existence of paths 7ri2, vr34 of specified lengths through e„ (section 3.1 ) 



Conditional on existence of such paths, what is the chance they are the shortest paths? The 
percolation processes from vertice s i = 1,2,3,4 avoiding edge e„ become approximately 



size-biased Yule processes (section 3.4) reaching VFjC* vertices in distance t; so the chance 
of a path from 1 to 2 of length t avoiding is approximately exp{—WiW2e^) (section 3.5). 



These two estimates are combined in section 13^ 
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3.1 Joint intensity for two short paths through e 



For this section we introduce some handy notation. We will describe the relationship (for an 
event B, a random variable T, and a function /) 

F{B, T G {t,t + dt)) = f{t)dt 

by the phrase 

the event [[B, T = t]] has intensity f{t). 

But we will describe events in words, rather than inventing ad hoc symbols, using the brackets 
[[ ]] to highlight the verbal description of the event. 

For a G L(t) and (3 G R{t) define ha,i3{si, S2) to be the intensity (in si and S2) of the event: 



[[there exists a path from 1 to vl which crosses the neighborhood boundary at time 
si and then first hits vertex a; 

and there exists a path from 2 to w/j which crosses the neighborhood boundary at 
time S2 and then first hits vertex /?.]] 



Note that such paths, linked via the path from q to /3 in the neighborhood, specify a path 7ri2 of 
length si + S2 + cr from 1 to 2 via e. This path may or may not be the shortest path from 1 to 
2, depending on lengths of other edges in Qn- 

Given also 7 G ^(t) and S G R{t), define an event which replicas the event above: 



[[there exists a path from 3 to vl which crosses the neighborhood boundary at time 
ti and then first hits vertex 7; 

and there exists a path from 4 to vji which crosses the neighborhood boundary at 
time t2 and then first hits vertex 5.]] 



Again, such paths specify a path tt^^ from 3 to 4 via e. In order for it to be possible that both 
7ri2 and 7r34 are shortest paths, the following simple compatibility conditions must hold. 

(i) if the paths from 1 and from 3 meet at some vertex outside the neighborhood, then they 
must coincide from to the neighborhood. 

(ii) if the paths from 2 and from 4 meet at some vertex v* outside the neighborhood, then they 
must coincide from v* to the neighborhood. 

(iii) the set of vertices visited by the paths from 1 and 3 must be disjoint from the set of vertices 
visited by the paths from 2 and 4. 

Define Ha^/3^-y^s{si, S2,ti,t2) to be the intensity of both events happening and the compatibility 
conditions holding. 



Lemma 20 



ha,(5{si,S2) < ^exp(si + S2) (42) 

Ha,f3,-y,5{si,S2,ti,t2) < exp(si + §2 + + ^2) • (43) 



Here k = Ka,f3,-y,5 as at (30) 
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Note these are inequalities for finite n. Heuristically they are asymptotic equalities in the ranges 
of interest to us. 

Proof. The argument is based on exact formulas, starting with the following. The intensity of 
the event 



[[there exists a path from 1 to vl which crosses the neighborhood boundary at time 
si and then first hits vertex a, taking exactly k + 1 steps]] 



(n - 1 - #t)fc 4 , 
k\ 



^ exp(^^) 



(44) 



where we recall that h{a) is the distance from a to the neighborhood boundary. To prove this, 
take < til < n2 < . . . < Ufc < si and consider the probability that the j'th step ends at distance 
[uj,Uj + duj] from 1 and the boundary crossing is at distance [si,si + dsi]. This probability is 

k 

(n - 1 - #t)fc X i exp(-('Uj - Uj-i)/n)duj x i exp(-(si + 6(a) - Uk)/n)dsi (45) 
i=i 

where the first term indicates number of choices of k intermediate vertices, and the other terms 
are the Exponential(mean n) densities of edge-lengths. Because 



/•••/, 



0<u\<U2<...<Uk<s 



^ dui . . . duk = s\/k\ 



(46) 



we deduce (44). 



The first and last terms of (44) are < 1. Summing over k shows that the intensity of 



[[there exists a path from \ to which crosses the neighborhood boundary at time 
si and then first hits vertex a]] 



is < ^ exp(si). Combining this with the similar argument on the right side of the neighborhood 
gives ( 42 ) . 



To prove (43), because the left and right sides have analogous arguments, the issue is to show 
that the intensity of the event 



[[there exists a path from 1 to vl which crosses the neighborhood boundary at time 
si and then first hits vertex a; 

there exists a path from 2> to which crosses the neighborhood boundary at time ti 
and then first hits vertex 7]] 

<2i(°=7)J_exp(si+fi). (47) 

Now in the case a 7^ 7, or for the contribution to the case a = 7 from disjoint paths, we get 
density < ^exp(si + ti) by arguments analogous to above. Let us show details of the more 
interesting case where a = 7 and we consider the contribution from merging paths. Consider the 
intensity (in r, si,ti) of the event: 
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[[there exists a path from 1 to vl which crosses the neighborhood boundary at time 
si and then first hits vertex a, taking exactly fci + ^2 + 1 steps; 
there exists a path from 3 to vl which crosses the neighborhood boundary at time ti 
and then first hits vertex a, taking exactly ^3 + ^2 + 1 steps; 

these paths merge at some vertex v^, at distance r from the neighborhood boundary, 
the path from to a using /c2 + 1 steps.]] 



(n - 1 - #t)k,+k2+ks-i {si - r)^i {ti - r)^3 r^^ ^ ^ -sr-t^+r-b{a) 

exp\^ 



Analogous to (44) this intensity has an exact formula 

- r)'=i {h - r] 
ki\ k^l /C2! 

1 (si -rf^{ti -rfir^^ 

Summing over (fci, A;2, /cs) shows that the intensity of 



This intensity is bounded by 



[[there exist paths from 1 (resp. 3) to vl which cross the neighborhood boundary at 
time si (resp. ti) and then first hit vertex a, having merged at distance r before the 
boundary]] 



is bounded by ^expfsi +ti — r). Integrating over r shows that the contribution to (47) from 

^ 1 I I I 

merging paths is < exp(si + This establishes (47). 



3.2 A Cox point process 



Here we introduce a process and a lemma; how the process arises will be seen in section 3.5 



Take independent random variables Wi,W2 with probability density ■we~^ on < w < 00. 
Consider the Cox point process defined by: conditional on Wi,W2 the points form a Poisson 
process of rate {WiW2e'^, —00 < s < 00). Let L be the position of the leftmost point of this Cox 
process. 

Lemma 21 e^P(L > s) ds = l. 



Proof. Note P(L > ^^2) = exp(-VFiW^2e*) and so 

e"P(L>s)ds = / e'Eexp{-WiW2e') ds 

D 

) 

E exp{—WiW2u) du setting u = 



—00 
00 



E 



1 

W1W2 



E. 



1 



1. 
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3.3 Conditional distributions of other short routes 



Recall we are conditioning on Mrin — 1, n) = t^^, though this is not indicated in notation. 

Fix times si,S2,ii,t2 and vertices (maybe the same) a,j £ L(tA„) and (3,6 £ i?(t^„). Recall 
from section 3.1 that -f^a,/3,7,5('5i, S2,ti,t2) denotes the intensity of the event 



[[there exists a path 7ri2 from 1 to 2 via e„, where the path from 1 crosses the boundary 
of the neighborhood at time si and then first hits vertex a, while the reverse path 
from 2 crosses the boundary of the neighborhood at time S2 and then first hits vertex 
P; similarly there exists a path 1134 from 3 to 4 via €yi, where the path from 3 crosses 
the boundary of the neighborhood at time ti and then first hits vertex 7, while the 
reverse path from 4 crosses the boundary of the neighborhood at time t2 and then 
first hits vertex 6 ]] 

together with certain compatability conditions. We will write ( • |si, S2i ^i, ^2) to denote condi- 
tioning on this event. For such paths we have 

len(7ri2) = 51+52+0", len(7r34) = ti + t2 + a, 

and we write len(7ri2) and len(7r34) for these sums. 

Write 512 (resp. 534) for the length of the shortest path from 1 to 2 (resp. from 3 to 4) that does 
not use edge e. Let us first show that Proposition |18| reduces to the following proposition. 

Proposition 22 Fix B < 00. Uniformly on {max(len(7ri2), len(7r34)) < logn + S}, as n —> 00 
¥{Si2 > len(7ri2), ^34 > len(^34)| Si,52,ti,t2) < 1P(CP > len(7ri2)-log n, ^f"^ > len(^34)-logn)+o(l) 
where (^i^,'^2^, • ■ •) and {Ci'^,^2'^, ■ ■ ■) are independent Cox point processes as described in section 

El 



We will prove this in sections 3.4 - 3.5 but let us first show how to deduce Proposition 18 from 



Lemma 20 and Proposition 22 



Proof of Proposition 18, A path 7ri2 via Cn using entrance-exit pair (a,/?) with len(7ri2) 



si + fj + S2 is created as in the definition of /iq,/3(si, S2) from two paths with lengths-to-boundary 



si and 82- Create 7r34 similarly, using paths of lengths ti and t2. The quantity in Proposition 18 
P(7r(l,2) contains with entrance-exit pair len(7r(l,2)) < logn + 5; 

7r(3,4) contains Cn with entrance-exit pair (7,(5), len(7r(3,4)) < \ogn + B) 



can be calculated in terms of the intensity H aX ( 43 ) as 



l{si + S2 + o- < log n + B]l{ti +t2 + a <\ogn + B] (48) 
P(7r(l,2) = 7ri2,7r(3,4) = 7r34|si, S2, ti, t2) -ffa,/3,7,<5(si, ^2, ^2) dsids2dtidt2. 



Now 



(7r(l,2) = 7ri2,7r(3,4) = 7r34|si, S2, ^i, ^2) < IP(5'i2 > len(7ri2), S'34 > len(7r34)| 51,52,^1,^2), 
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this being an inequahty because there might be shorter paths using e. Bounding the right side 
using Proposition [22] gives 

P(7r(l, 2) = ^12, 7r(3, 4) = TTsilsi, S2,h,t2) < P(eP > s') > t^) + en,B 

where hm„ en,B = and where 

s""^ = si + ,52 + cr — log n; = ti + t2 + a — log n. 



To upper bound (48), first fix and and calculate 

Ha,f^,^,s{si,S2, ti,t2)dsidti 



< 



exp(si + S2 + ti + t2)dsidti by Lemma 20 

= ^ exp(s^ + - 2a) x (log n + - cr)(log n + - a) 
< Ke-^'^ ^'°^lt''^\ xp{s'+t') 
where in the final line we assume max(s^,t^) < B. Thus the quantity (48) is bounded by 



(logn+B)^ 2a 

fx 9 C 



/ / (P(eP > 5l)p(^34 > ^1) ^ g.i^gl /^^l 

J —oo J~oo 



< ^i^^s^+^e-^'^ (1 + e„,Be2^) by Lemma [21] 



establishing Proposition 18 



3.4 Size-biasing the percolation process and Yule process 



This section builds up to proving a result, Proposition 25 about the number of vertices seen by 
the percolation process from vertex 1 when we condition on existence of a short path of specified 
length from vertex 1. 



3.4.1 Some terminology 



Let us quickly revisit the structures (section 2.1) within Qn associated with percolation from 
vertex 1 and introduce more precise terminology. The percolation tree itself is the spanning tree 
consisting of all edges in the shortest paths 7r(l, j), 2 < j < n. The percolation tree process tells 
us at time (time = distance) t the subtree on vertices within distance t from vertex 1. And the 



percolation counting process Nn{t) at (10) tells us at time t the number of vertices within distance 



t from vertex 1. We can use the same terminology for the Yule process of section 2.3 the process 
(Noo{t), < t < oo) is the Yule counting process. The underlying continuous-time branching 
process, run until time t and then regarded as a random tree with edge-legths, is the Yule tree 
process at time t. This process run to time oo is the Yule tree or PWIT, a random infinite tree 
with edge- lengths. 



3.4.2 Heuristics for size-biasing 



Associated with the Yule counting process is the limit (Lemma |3|^b)) random variable W := 
linit e~^Noo{t) with probability density e""^ on < w < oo. What can we say about the Yule 
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tree conditioned on it having a vertex at some specified large distance Iq? The probabihty of this 
event given W is approximately proportional to W, so the posterior density of W given this event 
becomes approximately we~^ . This is a basic instance of size-biasing. But instead of relying on 
Bayes calculations for single random variables, we describe next the more elegant approach to 
size-biasing the whole process based on a probabilistic construction (this type of construction is 
widely used in modern branching process theory [15 ). In this method the density we~^ arises 
as the density of the sum of two independent Exponential 1) random variables. 

3.4.3 The size-biased Yule process 

We are working toward a result of the type 

the n — > oo limit of the size-biased percolation process is the size-biased Yule process 

and now we will define and derive simple properties of the limit process, without justifying the 
"size-biased" name. 

On the half-line put a "root" vertex at the origin and other vertices at the points {Pi,i > 1) 
of a rate 1 Poisson process. Make each of these vertices the root of a Yule tree. Regarding the 
resulting structure as a random infinite tree with edge-lengths, call it the size-biased Yule tree, 
with root at the origin. Given a distance t, the size-biased Yule process at t is the subtree on 
vertices at distance less than t from the root, illustrated in Figure |3| The associated counting 
process, giving the number of vertices at distance less than t from the root, is 

N{t) = No{t) + Ni{t - Pi) (49) 

i:Pi<t 

where (A'^o, A'^i, • • •) are the Yule counting processes associated with the constituent Yule trees. 

Call the original half-line the distinguished path to infinity. We will also use, for technical reasons, 
the variation where the distinguished path is cut at some at some large finite distance s from the 
origin, so its counting process is 

N%t) = NQ{t)+ Yl Niit-Pi). 

i:Pi<min{s,t) 



We collect below some simple facts about these two processes. Our main aim is to understand 
the limiting behavior of N(t) for large t, and similarly, the behavior of N'^{t) for large s and t. 

Lemma 23 (a) There exists a random variable W with probability density we~'^, w > such 
that 

lim e-^N{t) = W; lim lim e"*iV'(t) = W (50) 

t — ^CXD S — >CO t — ^OO 

where the convergence holds a.s. and in L^. 
(b) For any c,s,t>0 

EN'{t) (^c + N'{t) + #{i:t<Pi< s}) < 6e^* + 2e*(c + s). 
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Time Axis 



Figure 3: Size-biased Yule process 



Proof. The construction (49) implies that the size biased process A^(-) can be represented in 
terms of the sum of two independent Yule processes 



N{t) = Noit) + (iV'(t) - 1) 



(51) 



because the contribution from the distinguished path to infinity behaves as another Yule process 
rooted at the origin, with the distinguished path representing the reproduction times of the initial 
ancestor. We subtract 1 to avoid double counting the root. Lemma|3|^b) says we have independent 
Exponential(l) limits (a.s. and in L^) 

Wo := lime"*iVo(t); W = lime~*iV'(t) 



and (50) follows easily. For (b), because N^{t) is independent of #{i : t < Pi < s}, the quantity 



under consideration equals 



(c + (s - t)+)EN'{t) + E{N%t)y' 



Use the inequality N'^{t) < N{t) and the inequalities (from (51) and Lemma [Sf^a) ) 

EiV(t) < 2e*; var N{t) < 2e^* 

to complete the proof of (b). ■ 



3.4.4 The percolation counting process conditioned on existence of a path 



Proposition 25 will formalize the idea 



Conditional on existence of a path from vertex 1 of specified length, the percolation 
process is approximately the size-biased Yule process. 
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In the following lemma, "number of vertices" excludes vertex 1, and we are conditioning on 
length-to-boundary being s. 

Lemma 24 Fix a £ L(t). Condition on the existence of a path of length s + b{a) from vertex 
1 to vertex a. Let denote the number of vertices on this path, and ?7(i) < [/(2) < • • • < f^(ps) 
denote the distances of these vertices from vertex 1. Then 

(a) The exact distribution of P.^ is 

P(P^ = fc)=C(g,n) ^^~^;^^^^ ^, 0<A;<n-l-#t (52) 
where C{s,n) = (^X]j=o ^~~~W^~^fr) normalizing constant. 

(b) Conditional on P^ the (C/(fc)) are distributed as the order statistics of P^ independent Uniform{0, s) 
random variables. 

(c) Suppose Sn — > oo, Sn = o{y/n). Then the variation distance between the distribution of P^" 
and the Poisson(sn) distribution tends to as n ^ oo. 

Proof. Formula ( 45 ) says that the intensity of the event 

[[there exists a path of length s + b{a) from 1 to a whose vertices are at distances 
< ui < . . . < Uj < s ]] 



is of the form c(n, s,a) (n — 1 — #t)jn ^. Now (a) follows from the integral identity (46), and 



(b) follows from the uniformity of the density in (ui, . . . ,Uj). And (c) follows from (a) because 
the ratio ^" tends to 1 for j = o{^/n). m 

For the main result of this section, we study a certain pruned percolation process which we 
now define carefully. Recall that the percolation counting process Nn{t) counts the number of 
vertices j such that there exists a path vr from 1 to j of length < t. For the pruned percolation 
counting process we impose two extra restrictions on vr. First, tt must not use any vertex in the 
neighborhood Mrin — l,n) = tA_„. Next, we will be conditioning on existence of a path, say 
{l,rji,rj2, . . .), of specified length from vertex 1. Say vr contains a short-cut if the path vr meets r]j 
for some j > 1. The second restriction on vr is that vr must not contain any short-cuts. 



Proposition 25 Consider a sequence Sn satisfying ujn ^ Sn < logn -|- B, and a vertex a G L{t). 
Condition on the existence of a path from 1 to a of length Sn + b{a). Let N^"(t) be the pruned 
percolation counting process defined above. Then for each n there exists a random variable Wn 
having density we~^ on M'^ such that 



sup 

aj„<t<logn— a;,, 



'N^-it)-Wr, 



in probability 



as n ^ oo. 



We give the proof in some detail; later (Proposition 27 ) we need the variant for percolation from 



several sources, and we will omit details of that variant. 
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3.4.5 Proof of Proposition 
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We start with some finite error bounds for tlie two size-biased Yule processes N{-) and A^*(-) that 
were introduced in section I3.4.3I 



Lemma 26 Consider any sequences — > oo. 

(a) Fix e > 0. Recall the limiting random variable W from Lemma 

constant C such that 



23 Then there exists a 



sup 

t>w„ 



e~^N{t) - W 



>e] <Ce- 



(h) Fix e > and consider the processes N^"(-). Then there exist random variables Wn having 
density we~'^ on such that 



sup 

t>iUr,. 



Proof. From (51) we see that e * • {N{t) + 1) is a martingale. Part (a) follows from Lemma 23 
and the maximal inequality for martingales. 

To prove part(b), let Y{t) be a Yule process independent of A^*"(-) and define the process 



Znit) = ^ N'-{t) 

= N'"{t) + Y{t-Sr. 



for t < Sn 
1 for t>Sr,. 



Note that the process Zn{-) has the same distribution as the untruncated size biased Yule process 
N{-). Thus there exists a limiting random variable Wn with density x ■ such that inequality 
(a) is satisfied with Zn{-) in place of N{-). Now note that for any t > Sn 



e-'Zn{t) = e-'N'-{t) + e 



-{t-Sn) 



Y{t 



For our desired asymptotics we can ignore the final — e * term, and write 



sup 

t>LOri 



N'"{t) 



Wn 



>e < 



sup 

t>UJ„ 



Zn{t) 



Wn 



>2' + 



y{t) £ 

e • sup > - 



Applying the maximal inequality to the martingale e ^Y{t) gives P ( e • supj>o > | 1 < 



Y{t) 



Ce-'e 



Combine with part (a) of the Lemma applied to Zn{-) to get the result. 



We now give a construction of the pruned percolation counting process N^'^{t), designed for 
comparison with a similar construction of the size-biased Yule process. Recall we are conditioning 
on existence of a "distinguished" path from 1 to a of length s„ -|- 6(a). Write = ?7o < t^i < 
U2 < ■ ■ ■ < Up^ < Sn for the distances from 1 to the vertices within this path. Define 



We can write 



Anit) := #{i > : U,< t}. 

N^-{t)=An{t)+Gn{t) 



(53) 
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where Gn{t) is the number of vertices in the pruned percolation counting process which are not on 
the distinguished path. By definition the process G„(-) evolves as the counting process satisfying 

¥{Gn{t + dt)-Gn{t) = l\Qn{t)) = n-^ {An{t) + Gn{t)) (n - P„ - #t - G„(i)) dt (54) 

because the number of vertices wetted at t equals An{t) + Gn{t) and the number of available 
vertices to be wetted equals n — — #t — Gn{t), the terms Pn and arising from the two 
restrictions in the definition of pruned. The filtration used here has ^n(O) as the cj-field generated 
by the ([/,) and then Qnit) = a(a„(0), G„(s), <s<t). 

To relate this construction to the size-biased Yule process, it does no harm to assume (by vari- 
ation distance convergence, Lemma [24|^c)) that P„ has exactly Poisson(s„) distribution, so that 
(C/i, . . . ,Up^) are the points of a rate-1 Poisson point process on (0, Now we could construc- 
tion the size-biased Yule process, cut at Sn, via 

N'"{t)=Ar,{t)+Cn{t) 

where Cn(-) evolves as the counting process satisfying 

F{Cn{t + dt)-Cnit) = l\g{t)) = {An{t)+Gnit)) dt (55) 

for appropriate filtration (^(t)). But it is more useful to couple the two processes by first defining 
N^"{-) via (53) and then defining iV'^"(-) via 

N'" (t) = An{t) + Gn{t) + Bn{t) (56) 

where Bn{-) evolves as the counting process with i?n(0) = and 

¥ {Bnit + dt) - Bn{t) = l\gn{t)) = {bn{t) + Bn{t)) dt (57) 



where (subtracting from ^) 6„(t) + Bn{t) = {An{t) + G„(t) + Bn{t)) - n"^ {An{t) + 
Gn{t)) {n — Pn — #t — Gn{t)), which works out as 

< bn{t) = {An{t) + Gn{t)) ^-^+#^+^4*) ^ 

In particular, Nn"{t) < N^"-(t), and Bn{-) is the number of extra vertices in the size-biased Yule 
process but not in the pruned percolation process. In view of Lemma 26 ^b), to prove Proposition 



25 it is sufficient to prove 



sup e ^Bn{t) — > in probability. (58) 

a;„<t<log n—LOn 



Note that we can write 



bnit) = tT^ Nj-{t) {N^-{t) + #t + #{i:t<Ui< s„}) 
< N'"{t) {N'-{t) +#t + #{i:t<Ui< s„}) 



and then Lemma [23[c) implies 

Ebn{t) < n-i(6e^* + 2e*(#t + s„)) := a„(t), say. (59) 

Consider the event 

0„:={5„(ilogn) =0} 
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that the two processes coincide up to time ilogn. Using (57) 



1 - < 



log n 



Ebn{t)dt < 



log n 



an{t)dt 



and so IP(iln) 1- Next observe that e ^Bn{t) is a submartingale, because 

d{e-^Bn{t)) = e~^dBn{t) - e~^Bn{t)dt 

and so 

W.{d{e-*Bn{t))\Gn{t)) = e-%n{t)dt > 0. 



Appeahng to the maximal inequahty for submartingales, to prove (58) it is now enough to 
prove 

Ee~*"5„(t„)l(17n) for t„ = logn - ujn- (60) 



Write fn{t) = EB„(t)l(J^„) for t > |logn, so that /„(^logn) = 0. Using (|57[, f'^{t) < + 
an{t) and so 



Using (59), for i > I logn we have an{t) < Ce^^ /n for some constant C. So (e ^ fn{t))' < Ce^ /n 
and then (60) holds because 

e"*"/n(tn) = i Ce^/n dt < Cn~^ exp(t„) ^ 0. 

: log n 



3.5 Proof of Proposition 22 



Proposition 25 studied the pruned percolation counting process starting from vertex 1. We 
now want to consider four such processes running concurrently, starting from vertices 1,2,3,4 
("sources"). In this setting, if one of the flows reaches a vertex j which was previously reached 
by a different flow, we say a collision occurs, and vertex j is only counted in the counting process 
{M\t) below) for the source i whose flow first reaches j. 



Recall the setting of Proposition 22 we say "conditional on si, 52,^1,^2" to mean conditional on 



the event described at the beginning of section 3.3 , Note that we condition only on the lengths and 



not on the internal structure of the four distinguished path segments. The values of si, S2, ii, ^2 
(which depend on n) are assumed to satisfy 



< := si + S2 + a <logn + B; LOn < t^ := ti + t2 + a < logn + B. (61) 



For i = 1,2,3,4 write Nn\t) for the number of vertices reached by the flow started at source i 
before time t, in the concurrent flow process. This process differs from the 4 separate processes in 
two ways. First, the elimination of collisions, as described above. Second, we extend the notion 
of (forbidden) short-cuts to say that a path of the percolation process may not meet any of the 
4 distinguished paths. Of course for each i, the number Nn\t) is bounded by the corresponding 
number in the percolation flow from i when the other flows are not present, which was the context 



of Proposition 25 



■ — (i) 

Proposition 27 Conditional on si, S2,ti,t2, there exist random variables Wn such that 
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where the limit r.v. 's are independent with density we and such that, for any uJn ^ tn ^ 
^{logn + B), 



e-^"N^'\tn) - Wj,'^ ^ in probability (62) 



for each 1 < i < 4. 



Proof. The proof involves only minor modifications of the proof of Proposition 25 - the essential 
issue is to show that the two changes (collisions; short cuts) in going from separate to concurrent 
processes has negligible effect. We omit details. ■ 



Recall the definition of 5*12, 5*34 and the definition of the Cox point processes from section [3^ Let 
SI2, (resp. S'34) be the times of the first collision (within the concurrent flow process) between 
the flow processes starting from 1 and 2 (resp. from 3 and 4). Note that if a collision occurs 
between the flow processes started at 1 and 2 at time t, then there is a path of length 2t from 1 
to 2, and (because we do not allow flow through the neighborhood Mri^n — l,n)) this path does 
not use the distinguished edge of the neighborhood. So 5*12 < 23^2 and 5*34 < 2S'34. This is an 
mequality because there might be shorter paths that were "pruned away" in the processes we 
have studied. So to prove Proposition [22] it is enough to prove the following 

Proposition 28 Conditional on si, S2,ti,t2, 

(25i*2 - log n, 2^3*4- log n) S (CP,ef) (63) 

as n ^ 00. 

Proof. Condition on the concurrent flow process until time t, and suppose the flows from source 
1 and source 2 have not collided before time t. Then the instantaneous conditional probability- 
per- unit-time of a collision ( "hazard rate" ) equals 

^ 2Ni'\t) N^jt) 
" n 

because the unseen length of each possible edge has Exponential 1/n) distribution. We are 
interested in the recentered process 25^2 ~ log ^- This process has hazard rate 

T,, un^H ^ s ivP(^logn+ig) jv(^)(llogn+i.) 

An(s = iA„(^s+ ^logn = e ^1 r, (64 

Now use Proposition [27] to conclude 

e~'Xn{s) ^ W'^^^W^'^^ uniformly on 2cj„ - logn < s < 

This easily implies 1S\2 — logn because is defined to have hazard rate e^VF^^^PF*^^) 



on —00 < s < 00. The joint convergence (63) follows by the same argument, the independence 



of the limits (VF'-*-') in Proposition 27 implying independence of the limits here. 



4 Further discussion 



4.1 Analysis of the limit function G[z) 

Recall 

(•oo 

G{z) = / P(VFiVF2e-" > z) du 
Jo 
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where Wi and W2 are independent Exponential(l). The Melhn transform of G is 

/>oo 

$(y) = / zy-^Giz)dz 
Jo 

/•oo /■CO roo / \ 

= / / / exp(—w)zy~^exp\ I dzdudw 

Jo Jo Jo \ w J 

poo poo 

= T{y) / e""^ / e~'"wydwdu 
Jo Jo 

POD 

= r{y)T{y + 1) / e-'^ydu 
Jo 

r{y)r{y + i) 
y 

= imf- 

Checking a table of Melhn transforms ([25] II. 5. 34) we see 

G{z) = 2Ko{2z^^^) 

where Kq is the modified Bessel function of the second kind. The standard asymptotics of Kq 
([7J 4.12.6) say 

nr 

Kq(x) ~ a / — exp (—x) as ^ 00 
V 2x 

and so 

G(z) ~7ri/2z-i/^exp(-2V^) as 2; — > cxD. 
4.2 Methodology of relating local and global structure 



As illustrated by the heuristic argument in section 1.3 the conceptual point of Theorem [T] is 



that a quantity depending on the "global" structure of the network can be studied statistically 
via a "local" (i.e. large fixed distance) calculation. This reduction to local structure is, to 
our understanding, the central point in the powerful non-rigorous cavity method of statistical 
physics [19] . In our attempted mathematical reformulations of the cavity method as applied to 
combinatorial optimization problems such as TSP [3 El S] in this random network model Qn, 



we made explicit use of the n — > 00 limit structure (the PWIT of section 2.3) of this model as 
viewed from a random vertex. In these harder problems one needs rather abstract, often as yet 
not rigorously justified, arguments to connect local and global structure. The problem in this 
paper seems conceptually easier in that we can use concrete calculations instead. 



4.3 Flows through vertices 

In the setting of Theorem [l] one could alternatively consider flows through vertices instead of 
edges. Let us state this alternative result and indicate the derivation of the limit distribution 
without giving details of proof. 
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Figure 4: The function G{z), drawn using Mathematica's numerical integration toolbox. 

Write F*{v) for the flow through vertex v £ [n]. Let (Wi,i > 1) be independent Exponential(l) 
r.v.'s and let < < ^2 < • • • be the points of a Poisson (rate 1) process on (0, 00). Define 



Corollary 29 In the setting of Theorem [IJ as n —> 00 for fixed z > 0, 

l#{v : F:{v) < zlogn} -^l, F(H < z). 

The formula is most neatly derived using the n — > 00 limit PWIT structure of ^„ [HI . Relative to 
a typical vertex v of the PWIT, the edge-lengths to adjacent vertices (vi) are distributed as 
the points of a Poisson (rate 1) process on (0, 00). For each Vi let Ni{t) be the number of vertices 
within distance t from Vi using paths not via v. Then e~^Ni{t) Wi for i.i.d. Exponential(l) 
r.v.'s iyVi). The relative volume of flow through the two edges Vi ^ v ^ vj will then be 
WiWj exp{—£^i — ^j) by the argument for Theorem [Tj 

4.4 Different models of random networks 

The heuristic argument of section [T3] can be carried over to a variety of random networks models. 
For example, fix a degree distribution P(A = i), « > 1 with finite 2 + e moment. There are 
several ways (e.g. the "configuration model") to formalize the idea of a n- vertex graph which 
is random subject to the constraint that the n — > 00 asymptotic degree distribution is A. Such 
models have local weak limits which are simple branching processes; looking outwards from a 
typical edge e, each end- vertex is the founder of a Galton- Watson branching process with offspring 
distribution 

P{A* = i) = {i + 1)P(A = i + 1)/EA. 
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Thus we expect the average vertex-vertex distance Dn in such a random graph to behave as 

log n 



log EA* ' 



See [23] for proofs for several models. Now make a random network by assigning independent 
random lengths r/g to edges e (note that here we do not scale edge-lengths with n). Then the 
Galton- Watson process above becomes a general Markov branching process in which individuals 
have A* offspring at independent ages (r/j); the population size process Noo{t) has some Malthu- 
sian growth constant 6 and some a.s. limit exp(— 0t)A^oo(i) Z. The heuristic argument from 



section 1.3 now suggests that the limit joint distribution of edge-lengths and relative edge- flows 
will be 

(?7, ZiZ2exp(-6'r?)) 

where (77,^1,^2) are independent. But giving a rigorous proof for the models in [23] may be 
technically challenging. 

Finally, one might consider models on the two-dimensional lattice with i.i.d. random edge- 
lengths. Here, studying lengths of shortest routes is tantamount to studying (unoriented) first 
passage percolation [23]. However, if i is close to i' and j is close to / then we expect the routes 
from i to j and from i' to j' to coincide except near the endpoints. This suggests a quite different 
distribution of edge-flows, more specifically that F{e) should have a power-law tail. 



4.5 Random demands 



A small variation of our model is to assume that the total flow to be routed from vertex i to 
vertex j is a random variable Dij/n instead of 1/n; the flow is still routed along the same shortest 
path as in the uniform demand case. So the flow across edge e is 

^n(e) = -V V A,l{eG7r(i,j)}. 

ie[n] je[n],j=/=i 

Because the flow across an edge e is made up from many different source-destination pairs, it is 
straightforward to add a "law of large numbers" step to the proof of Theorem [T] and obtain the 
following corollary. 



Corollary 30 (a) Suppose Dij > are independent with common mean < /x < 00 and with 
uniformly bounded second moments. Then 

1 

-#{e: Fn{e) > zfilogn} G{z), z > 0. 
n 

(h) Suppose instead the gravitational model Dij = DiDj where Di >0 are independent random 
variables with common mean < fi < 00 and with uniformly bounded second moments. Then 

-#{e : Fn{e) > z^l^ log n} G{z), z > 0. 



4.6 Joint distributions for shortest paths 



As described in section 2.1 various aspects of shortest paths in the model Qn have been studied. 



The following ideas will be developed elsewhere. There is a known (implicitly, at least) limit 



36 



distribution 

L>„(l,2)-logn ^ D(l,2) 
for distance between a typical pair of vertices. Now fix A; > 3. We expect a joint limit 

(Dn(l,2)-logn,...,A.(l,fc)-logn) 4 2), . . . , L>(1, A:)) (65) 

and it turns out the limit distribution is 

(L»(l,2),...,D(l,fc)) i (6 + r/i2,...,Ci + ??ife) 
where has the double exponential distribution 

<x) = exp(-e~^), -oo < x < oo 

the rjij have logistic distribution 

F{rj <x) = -oo < a; < oo 

and (here and below) the r.v.'s in the limits are independent. Now we can go one step further: 
we expect a joint limit for the array 

{Dn{i,j) -logn,l<i<j <k) S iD{i,j), l<i<j<k) 

and the joint distribution of the limit is 

iD{i,j),l<i<j <k) = i^i + ^j-^ij,l<i<j<k) 

where the limit r.v.'s all have the double exponential distribution. This implies two representa- 
tions for the original limit distribution: 

D{i,2) I Ci + m2 = 6 + 6-62. 
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